Quantum mechanics as a consequence of discrete interactions 
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Quantum mechanics is usually presented starting from a series of postulates about the mathe- 
matical framework. In this work we show that those same postulates can be derived by assuming 
that measurements are discrete interactions: that is, that we measure at specific moments in time 
(as opposed to a continuous measurement that spans a long time interval) and that the system is 
in general affected by our measurement. We believe that this way of presenting quantum mechanics 
would make it easier to understand by laying out a more cohesive view of the theory and making it 
resonate more with our physics intuition. 



I. INTRODUCTION 



It is unfortunate that, after more than half a century 
that quantum mechanics has become a core part of our 
scientific understanding, it is still surrounded by a cloud 
of mystery and perceived as strange and nonintuitivc. 
It is true that it does predict behavior that is odd and 
counter to our intuition, but does that have to imply we 
are bound to feel like something escapes us? 

Let us compare, for example, with another theory that 
also has strange consequences, such as the concept of 
spacetime, time dilation, length contraction or the equiv- 
alence of mass and energy. Yet, special relativity is al- 
ways presented as natural, in fact necessary. We believe 
that the main difference is that it is presented as coming 
from simple physical postulates, the principle of relativ- 
ity and the invariance of the speed of light, which help 
us make sense of all the other physical consequences. [1] 

Quantum mechanics, with its uncertainty principle, in- 
terference and probabilistic predictions, is usually de- 
rived instead from a set of mathematical postulates. Q 
Nothing tells us why a Hilbert space must be used as the 
phase space, or why observables are associated with op- 
erators: that is the starting point. The mathematical re- 
sults derived from the postulates need to be subsequently 
interpreted physically, with nothing else to connect them 
together but the mathematical framework. Should the 
physics not come first? Should the math not be derived 
from the physics? 



II. LOOKING FOR A PHYSICAL POSTULATE 

To reach our goal, our postulate has to satisfy the fol- 
lowing four requirements. 

First of all, it has to state an obvious physical fact: 
something that any person who studied quantum me- 
chanics would perceive as plain, even uninteresting. ^ 

Second, it has to imply that classical mechanics is in- 
sufficient. It has to tell us why we cannot use it, and, 
consequently, where we can. 

Third, it has to imply quantum mechanics. We need to 
derive the mathematical postulates that are the starting 
point of many textbooks:^ 

The phase space is a complex vector space 
where each direction represents a physical 
state 

The probability to transition from one 
(normalized) state to another during a mea- 
surement is given by the square modulus of 
the inner product: K^/I^Pi)!^ 

For every measurement there exists a cor- 
responding linear Hermitian operator. The 
only possible measurement values are the 
eigenvalues of the operator. The expecta- 
tion value of the measurement is given by : 
Upon a measurement, the state of 
the system will transition to the eigenstate 
associated with the eigenvalue 

Fourth, and last, it has to imply only quantum me- 
chanics. We should not, for example, derive a theory 



We are left to wonder whether we are missing some 
sort of physical postulate that could help us tie together 
the theory, that would show us why classical mechanics 
is insufficient and why quantum mechanics is necessary. 
We believe that if we were able to present quantum me- 
chanics derived from one or more physical postulates, it 
would increase our sense of understanding. What is un- 
derstanding if not being able to identify, in the midst of 
all that is confusing and misleading, that simple truth 
from which all others descend? 



^ After more than half a century that the theory is well established, 
it is unlikely some important point was missed. And even more 
unlikely that the author of this paper would be the one to find 
it! 

^ For brevity, we omit some more advanced cases, such as the phase 
space of a composite system or when more than one eigenstate 
is associated with the same eigenvalues. In addition we are not 
considering the Schrodinger equation a postulate, in the same 
way that in special relativity energy-momentum conservation is 
not a postulate. 
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that reduces to or is equivalent to quantum mechanics. 
Moreover, since there are currently many interpretations, 
we need to be compatible with all of them. And since 
quantum mechanics does not really give answers to such 
questions as realism or locality, we do not expect to give 
any either. 

In short, there should not be anything revolutionary 
about either the principle, or any of the items in the 

derivation. The novelty should mainly lie in how things 
are presented, and how they resonate better with our 
physics intuition. 



III. ON DESCRIBING A SYSTEM 

At the beginning of the 20th century, experimental 
and theoretical advances such as the ones by Thompson, 
Rutherford, Planck and Einstein started to show how 
both matter and the fields that describe its interactions 
are made of discrete elements. All physical processes are 
reduced to interactions among these particles. 

Since measurements are physical processes, it follows 
that they are also interactions. To measure a property of 
an electron we will need to make it interact with another 
particle, for example a photon. To measure a property of 
a photon we will need to make it interact with a charged 
particle, for example an electron. We have to assume, 
then, that at the fundamental level measurements are 
discrete interactions. By interactions we mean that the 
system we are studying is going to be affected: more 
specifically, that some future measurement will change 
because of our current measurement. By discrete wc 
mean that they happen at a specific instant. Two mea- 
surements, therefore, will be separated from one another 
and need to be ordered, which means we will not be able 
to describe the evolution of the system during a measure- 
ment (as it would imply a measurement, an interaction, 
during another measurement, another interaction): we 
will only be able to describe the system before and after 
the measiirement. In addition, the fact that measure- 
ments are discrete allows us to study them independently 
of the evolution of the system we are studying, as no 
evolution will happen during an instantaneous measure- 
ment. We can simply turn our attention to the effect of 
measurement, without studying what happens between 
measurements and within a measurement. 

The fact that measurements are discrete interactions 
will be our physical postulate from which we will de- 
rive quantum mechanics. We believe this to be a trivial 
enough fact to satisfy the first requirement. 

Next on our list is to show how this postulate is incom- 
patible with classical mechanics, so let us review some of 
its aspects. In classical mechanics we define the state of 
the system we are studying by a set of values that arc the 
result of a few measurements that represent the smallest 
set from which all the other possible measurements can 
be calculated. In many cases, for example, position and 
momentum represent the state, and other measurements. 



such as energy, can be derived from them. These values 
will change in time, depending on the external forces that 
act on the system, creating a trajectory that we usually 
describe using continuous and differentiablc function. 

There is an important underlying assumption that al- 
lows us to describe a system this way. The fact that 
we are not required to describe what and when we mea- 
sure means that we can always conduct a measurement 
that did not significantly interact with the system. Even 
when we do describe the effects of a measurement, the 
fact that we can fully describe the measurement itself 
means that, at least in principle, we assume there exists 
another measurement that would not disturb the system 
and that would allow us to get a full picture. In classi- 
cal mechanics, then, we are assuming that m,easurements 
are observations. By observation we do not mean what 
is usually meant in quantum mechanics: we simply mean 
the intuitive sense of looking at a system without chang- 
ing it, without interacting.^ 

In many cases we can disregard the effects of the mea- 
surement: if we throw a ball and follow its trajectory, the 
ball will be interacting with photons whose interactions 
are not going to significantly change its motion. In this 
case we can assume that our measurements are observa- 
tions, therefore we can use classical mechanics to describe 
the system. But if we arc trying to describe the motion 
of an electron, instead, we cannot simply disregard the 
effects of the photon interactions: we have to assume that 
measurements are discrete interactions. 

But what exactly changes when we assume that mea- 
surements are discrete interactions! We explore this 
question through a thought experiment. 



IV. A THOUGHT EXPERIMENT 

Imagine we have a system in an unknown state, and 
we need to conduct a few measurements to determine its 
physical state. We assume wc conduct one experiment at 
a time and that nothing is interacting with this system 
between our measurements. We will not even assume 
what those quantities are, and assign them the first six 
letters of the alphabet. 

We will start by measuring, A, then B, C, D, E and 
finally F. If we assume measurements are observations, 
we can go through all measurements, and we determine 
the full state. In fact, wc can even repeat those measure- 
ments, just to be sure. We can do this because nothing 
changes the state of the system, so all those values are 
guaranteed to be maintained. 



Since this is the critical point, we believe the words observer and 

observable are misleading in the context of quantum mechanics, 
and that is why wc will avoid them. For our purposes, observa- 
tion is a measurement where the interaction can be neglected, 
and that does not exist in quantum mechanics. 
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FIG. 1: If measurements are observations, we can measure all 
values one after the other multiple times. 



If we assume measurements are discrete interactions, 
though, things will be different. We will measure A with- 
out a problem, but already with B it could happen that 
the actual measurement modified the value for A. To be 
sure we need to measure A again. We can assume that B 
did not affect A, but, at some point, some measurement 
will affect some other measurement: the very definition 
of interaction implies that a future measurement is going 
to change. For example, let us say we measured A to 
be 7, and then we proceed to measure D to be 3. When 
we go back to measure A again, though, we do not see 
7 anymore, because the D measurement changed it, and 
we now measure 6. 




FIG. 2: If measurements are interactions, at some point one 
measurement will invalidate a previous one. 



Discrete interactions imply that, in general, some mea- 
surements are going to affect the outcome of some other 
measurements. We call these incompatible. Measuring 
one essentially invalidates the previous measurement: in 
our example, when we measured D, we needed to throw 
away the previous measurement of A because it was no 
longer valid. This means we need to choose what we want 
to measure carefully, since measuring one thing means 
that we will not be able to measure the other, ft also 
means that the order in which we measure different quan- 
tities affects the results. 

Since incompatible measurements cannot be known at 
the same time, we can only define the state up to a full 
set of compatible measurements (where we define full as 
a set in which it is not possible to add a new compatible 
measurement of which the outcome cannot already be 
predicted by the measurements already in the set). Those 



states are the only ones we will be able to distinguish 
experimentally. Note that this definition also holds for 
classical mechanics: the only difference is that, in that 
case, we can include all measurements since they will all 
be compatible with each other. 

Let us now suppose, then, we know the state of a sys- 
tem, and we perform a measurement: what will our pre- 
dictions be? If the measurement is compatible with mea- 
surements that define the state, we will be able to predict 
the value exactly. But what happens if the measurement 
is incompatible? The outcome is not part of the state 
and cannot be calculated from it. Therefore a precise 
prediction cannot be made. The best that we can hope 
for is a statistical prediction. 

To sum up, discrete interactions imply that there are 
incompatible measurements, that the state will not con- 
tain the results for all possible measurements, but only 
for a compatible set, and that predictions will be, in gen- 
eral, probabilistic. It should be clear now how discrete 
interactions are not compatible with classical mechanics, 
as they force us to redefine our concept of state. And it 
should also be evident how we have already found many 
of the core characteristics of quantum mechanics. 



V. ON INTERPRETATIONS 

We want to stress that we are not going to make any 
further assumptions regarding what the state represents 
and what happens during the interaction. 

To be specific, we have not said whether the state 
also represents some other physical reality as in many 
interpretations of quantum mechanics. We have not said 
whether the world is inherently probabilistic and/or God 
plays dice (as in the Copenhagen interpretation) : we only 
stated that we have to play dice since our prediction will 
be probabilistic. We have not said whether it is or it 
is not possible to extend the state with hidden variables 
that would make the measurement deterministic, even 
though they themselves could not be measured (as in 
any hidden variable interpretation, including Bohm-de 
Brogliejl, Q). We have not said whether many worlds, 
one for each possible measurements, stem from each in- 
teraction (as in the many- world interpretation!^ Q). On 
all of these issues we remain agnostic: they may better 
describe how and why we have discrete interactions, but 
we do not actually need anything so specific to proceed. 

For us the state, as we already said, just describes all 
the values that can be measured experimentally (again, 
this is also true for classical mechanics, with the differ- 
ence that in that case more can be known). Strictly 
speaking, though, the state is at least that: anything 
could be added to make the overall picture more precise 
in some way. But we stop at this common ground. 
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VI. ON MEASUREMENTS 

Let us see what we can know for sure about measure- 
ments. 

For each of them, we wih start in an initial state and 
end in a final state. As we said, we will not be able to 
fully determine the final state from the initial state, so 
we will have a probability distribution. 

Our measurements, though, need to be repeatable.''^ 
That is, if we just measured that A equals 7, we need 
to be able to re-measure 7 with certainty, provided no 
other interactions changed the state. This means that if 
we start from one of the possible final states for a mea- 
surement, it will not transition to any other state. Or in 
other words, the probability to transition to itself will be 
1, while the probability to transition to any other of the 
final states will need to be 0. 

Another thing that we can say is that in order for two 
measurements to be compatible, they will need to share 
the set of final states. This is necessary to be able to 
have repeatable measurements of both quantities. 

One thing that should also be apparent now is that 
measurements are actually performed on the final state, 
not on the initial one. When we measure that A equals 
7, we cannot really say that A was 7 before we made the 
measurement. It is only after the transition to the final 
state that we are sure that A is indeed 7, and that we 
can repeat the measurement as many times we want and 
keep obtaining 7. Before the measurement, we do not 
really know... It could have been 7. Or it could have 
been 8 and our measurement changed it to 7. It could 
be that it was not even defined. The point is: we cannot 
really tell. 

This might seem a subtle point, but it is actually 
changing the picture quite a bit. It means that when we 
are studying the state evolution, we are, strictly speak- 
ing, studying the evolution of our measurements, and not 
the evolution of the properties of the system we are inter- 
acting with. If measurements are observations, the two 
necessarily coincide; but with discrete interactions they 
do not. We cannot tell a priori how much of a discrepancy 
there will be. It is probably going to change from case 
to case, but in general we should be very careful when 
interpreting our measurement predictions as describing 
what the system is doing before we interacted with it. 

From this discussion, it seems we will need to be able 
to keep track of two things. The first is the probability 
to transition from one state to the other during measure- 
ment. The second is the set of all the possible outcomes 



One might argue whether repeatabiUty constitutes a separate 
principle. We note that a non-repeatable measurement would 
mean a measurement on an isolated system that returns different 
values every time. Such measurement would be meaningless, 
because we would not even be able to ascribe it to any particular 
system. Therefore we believe that measurement itself implies 
repeatability, or it would not be a measurement. 



of each measurement. 



VII. THE PROBLEM WITH CLASSICAL 
PROBABILITY 

When we ask "what is the probability of transitioning 
to this particular final state knowing that I start from 
this particular initial state?" we are posing a question 
that can be described by conditional probabilities. It is 
natural to ask whether we can use classical probability 
to describe them. Let us study a simple case to try to 
get an answer. 

We consider two measurements, A and B, which are to 
be maximally incompatible. That is, when we measure 
one, we do not know anything about the second. For 
simplicity, we assume that both measurements have two 
outcomes: a+, a~ and , . 

If we consider the event , we require that the condi- 
tional probabilities are the following: P(a+|a+) = 1 and 
P{a~\a^) = 0, which is a requirement for the measure- 
ment to be reproducible (we always have to measure a"*" 
and never a"); P{b+\a+) = 1/2 and P(&-|a+) = 1/2, 
which follows from the measurements being maximally 
compatible (we have an equal chance of getting any B 
outcome after having measured A). For all four events 
we will get similar numbers, as shown in Table U 



a+ 


a 


P{a+\a+) = 1 


P{a-\a+) = 


P{a+\a-) = 


P{a^\a-) = 1 


P{b+\a+) = i 


P(6-|a+) = i 


P{b+\a-) = i 


P(6-ia-) = i 


b+ 


6" 


Pia+\b+) = i 


P(a-|6+) = i 


P{a+\b') = i 


P(a-|fo-) = i 


P(b+\b+) = 1 


P(b-\b+) = 


Pib+lb") = 


P{b-\b-) = 1 



TABLE I: Conditional probabilities for two maximally incom- 
patible measurements. 



Given that 6+ and b~ describe probability distribu- 
tions in A, we expect to be able to express them in terms 
of a"*" and a~ . We can try to do this by combining a+ 
and a~ according to classical probability, which means 
we describe the outcome in which we get either a"*" or 
with equal chance. The conditional probabilities that 
predict our measurement values given this new outcome 
will be the averages of the a"*" and a~ cases, making all 
of them 1/2. The problem is that the case where every- 
thing is 1/2 is not a state at all: a state is defined by 
a complete set of compatible measurements, and a state 
where everything is unknown is not complete, since there 
is some measurement that can still be taken. 

What we actually want to describe when combining 
and is not, as in classical probability, the case where 
we get or a". We want to describe the case where 
a measurement changed the state so that a-f and a- are 
now equally likely. We are essentially asking: if after a 
measurement, A is completely unknown, what have we 
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measured? The answer is: we either measured b+ or b-. 
We know that by construction. But notice that we have 
no way of obtaining either of them by combining a+ and 
a~ . If we combine outcomes using classical probability, 
we can only increase uncertainty, so if we start from A, 
we cannot get B. 

From all of this follows that we need a different kind 
of probability. One that does not describe uncertainty 
coming from lack of measurement, but uncertainty com- 
ing from incompatible measurements. 

VIII. CONSTRUCTING THE PHASE SPACE 

We are now ready to construct the phase space: it will 
be a vector space in which a vector represents a state of 
a system. This is similar to classical mechanics, where 
each point in the phase space represents a state, but this 
is the only thing the two will have in common. 

Since we want to keep track of probabilities, we will 
use the metric of our space to describe them. To be more 
specific, we will want the norm of a vector to have to do 
with the probability of being in the state represented by 
that vector; we will also want the scalar product between 
two vectors to have to do with the conditional probability 
between the two associated states. 

First of all, we should understand what orthogonal di- 
rections represent, since this will also define the base and 
dimensionality of our space. As we said, we want the 
scalar product to be connected to the conditional proba- 
bility. We also noted that the conditional probability be- 
tween two outcomes of the same measurements is either 
1, if they are the same outcome, or 0, if they are different. 
This means that we will represent different outcomes of 
the same measurements by orthogonal directions in our 
space. If we have a set of compatible measurements, each 
orthogonal state will represent a definite outcome for all 
of them, and if the set is complete, the orthogonal states 
will describe all the possible orthogonal directions. In 
other words, they will be a base for our space. Any other 
vector in that space will then be a probability distribu- 
tion across those states. 

We need to be more precise on how exactly we rep- 
resent the conditional probabilities in our space. In a 
euclidian space the norm is given by:^ 

\\v\f = xl + xl+xl + ... 
^ha,t is good for describing lengths, because that is how 
orthogonal lengths sum. When we sum probabilities of 
orthogonal outcomes, though, we simply sum the proba- 
bilities, so we require that 

P{v\v) = P{xo\v) + Pixi\v) + P{X2\V) + ... 



For now, we are indeed considering a real vector space, as nothing 
has yet told us this is not suitable 



This means that the probability of being in the state 
represented by that vector will be the square of the norm. 
A proper physical state, then, should be represented with 
a vector of norm 1, to represent the fact that we are 
certain the system is in that state. Each component, 
instead, will represent the square root of the probability 
of measuring the state represented by the base direction: 

Xi = y/P{xi\v) = v X e^, 

The component is simply the scalar product with the unit 
vector along the appropriate axis: the scalar product will 
represent conditional probabilities. 

From the discussion above, we note that any vector 
in the same direction with a norm less than one is still 
going to represent the same physical state, just with a 
different probability of measuring that state. A physical 
state, then, is really a direction in the phase space, so any 
unit vector multiplied by any constant still represents the 
same physical state. As of now, it needs to be between 
and 1, since it just represents a probability. Later we 
will make use of this constant to represent quantities that 
span outside that range. 

To recap, the dimension of the phase space will be the 
number of all possible combinations of the outcomes of a 
complete set of compatible measurements. A unit vector 
in that space will represent a normalized probability dis- 
tribution, and the component along each direction will 
tell us the square root of the probability for that out- 
come. 
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FIG. 3: Phase space for two incompatible measurements. 

Let US go back to the previous example and have a look 
at how it all works: and a~ are going to be orthogonal 
to each other, and the plane they create will describe the 
probability space for A. What about B? We know that 
the component for 6+ will need to be ^1/2 on both a+ 
and a~ , which means that 6+ is going to be at 45 degrees. 
&^ will need to be orthogonal to fo"*", so it will need to be 
at 135 degrees. 

Note that if it were classical probability, we would actu- 
ally have four orthogonal directions: a^h^] a~^b~] a~b^; 
a~b~. This would also be the case if A and B were com- 
patible. But the fact that they are incompatible makes 
them live on the same plane, at 45 degrees from each 
other. 

So, what does it mean to describe the state in this 
space? It means that at any point in time, the state 
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vector is pointing toward a linear combination of A and 
B. That is the direction that points toward the measure- 
ment that is certain. The orthogonal directions, instead, 
represent measurements that we know that cannot hap- 
pen. All that is in between are measurements that are 
going to be uncertain, with the maximum uncertainty at 
45 degrees. This is exactly the kind of uncertainty we 
need to describe. 



IX. ON SPIN 

A careful reader already familiar with quantum me- 
chanics will have probably noticed that the space we con- 
structed in our example is actually the Z-X plane for a 
spin 1/2 system (we are disregarding Y). a"*" and a~ rep- 
resent z t and z |, while 6+ and b~ represent x t and x | 
states. 

Notice one important thing: z t and z [ are orthogonal 
in this space while they are 180 degrees apart in the phys- 
ical space; z ] and x ] arc at 45 degrees, while they are 
at 90 degrees apart in physical space. This should sound 
familiar since whenever we calculate spin probabilities, 
half of the physical angle always comes into play. What 
we are actually using is the angle in the phase space, not 
the physical angle; tlicy just happen to be related. An- 
other detail: we know that for a spinor we need to make 
two full turns in physical space to come back to the same 
state. Again, in the phase space we are actually making 
just one turn, but at half a turn, we are coming back to 
the opposite direction, which represents the same phys- 
ical state. The angle described by the spinor is not an 
angle in physical space, but is an angle in phase space. 

It should be clear by now that we are really heading 
toward the right direction: we are finding quantum me- 
chanics. 



X. GENERALIZING OUR PHASE SPACE 

Wo saw how our simple space worked for 2 measure- 
ments and 2 outcomes. We should now generalize to the 
case of n outcomes. Since we will not be able to visualize 
that through a pictorial representation, we will let the 
mathematical representation guide us. 

In the simple case, we had the vector components ex- 
pressed in A and B linked by a linear transformation (a 
45-degree rotation): 



case where there arc n outcomes, the vector will have n 
components, and the matrix will be n x n: 







b- 





V2/2 V2/2 
-V2/2 V2/2 





'a+ 




a~ 



(1) 



The square of the components represents the actual 
probability distribution. Note that the matrix needs to 
be unitary (or one probability distribution would not sum 
to one) and the elements represent the scalar product 
between the unit vectors of the different bases. In the 
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(2) 

If we assume that these are still fully incompatible mea- 
surements, all the conditional probabilities between A 
states and B states will need to be i/l/n, which means 
we need to have an n x n matix, where the modulus of all 
components needs to be the same, but the determinant 
is non-zero. This is of course not possible if the matrix 
is made of real numbers. 

It is important to understand what this means physi- 
cally. Suppose a matrix of real numbers represented the 
relationship between A and B, and we were given a prob- 
ability distribution of A: we would be able to determine 
the probability distribution of B. But this is not possible 
by construction: a uniform distribution for A could be 
had for any of the different B measurements. We were 
able to use a real matrix in the simple 2x2 case because 
we used a minus sign. 

We can do something similar if we use complex num- 
bers and use the phase in the same way we used the mi- 
nus sign: after all, a minus sign is a phase of 180 degrees. 
This means, though, that we need to extend all of the 
phase space to complex numbers. First of all, we need 
to redefine the scalar product by using, instead of the 
square of the components, the components times their 
complex conjugates: 

v*v = XqXq + x\xi + X2X2 + ... = {v\v) 

Also, the probability distribution is a vector of complex 
numbers. But this should not throw us off: it is still a 
probability distribution. In fact, it is two probability dis- 
tribution in one. To convince ourselves of this fact, let us 
look again at that system of equations: it is n complex 
linear equations, that is 2n real linear equations. If we 
fix both probability distributions for A and B, we arc fix- 
ing the modulus for both distributions, while leaving the 
phases undetermined. This leaves us with a system of 
2n equations in 2n real imknowns, which means that we 
will be able to determine the phases from the probability 
distributions. What happens is that, while the modulus 
is the probability distribution of one measurement, the 
phases describe the probability distributions of the in- 
compatible measurements. What exactly each phase is, 
we cannot say in general: it depends on the actual matrix 
transformation. But the overall point remains valid: the 
vector state is actually two probabilities combined. 

This is an extremely useful property to have. Physi- 
cally it will never make sense to operate on a probability 
distribution without operating on the distributions of all 
incompatible measurements. This is why we constructed 
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the space this way. By having the two distributions so 
combined, when we write an equation involving one we 
are actually writing an equation involving the other, too. 
The careful reader will have recognized that in the ba- 
sis of the position measurement, this is nothing other 
than the wave function. The modulus is the probability 
of measuring the system at a given position, while the 
phase will give us information about the maximally in- 
compatible measurement, its momentum. While making 
the relationship more precise is outside of the scope of 
this work, we want to underline how "wave function" is 
actually a poor name, as it too close to the wave inter- 
pretation. If we could rename it, it would be something 
along the lines of "multi-probability distribution" . 

In any case, we have finally arrived at the first math- 
ematical postulate: the phase space is a complex vector 
space. If we need to represent a continuous measurement, 
such as position or momentum, we will make the number 
of measurements go to infinity, and the distance between 
them go to zero, in which case the components of our 
state will be represented by a complex function.^ We 
have also have deduced the second postulate: the proba- 
bility to transition for two normalized states is given by 
the square modulus of the inner product 



(3) 



XI. MEASUREMENT REPRESENTATION 

The last thing we need to do is define how to describe 
the measurement itself. Ideally, we want to be able to 
keep track of all possible final states and of all the pos- 
sible values of the measurement. Since our description 
is statistical, we will also need some way to work with 
expectation values. 

There is a very convenient way to describe all of these 
by using the fact that we can multiply a state vector by 
a constant, and it still represent the same physical state. 
The overall idea is to multiply states by the measure- 
ment value, so that the metric of the vector is not only 
the probability, but the probability times the measure- 
ment value, which will give us the expectation value. For 
every measurement, then, we will have a corresponding 
operator that changes the vector in phase space. But we 
have to be careful to avoid misconceptions: this oper- 
ation does not represent the state change that happens 
during measurement. What it does represent is the trans- 
formation from the vector that represent the probability 
distribution to a vector that represents the expectation 
values. Let us see how this works in detail. 



Note that, up to now, we have avoided the use of Dirac notation: 
this was dehberately done. We beheve that, by using the famihar 
vector space notation as much as possible, the quantum phase 
space is introduced in a more famihar way. 




1 . state vector 

2. state components 

3. components multiplied by 
respective measurement values 

4. measurement vector - points to the 
state where expectation times square 
root of probability is maximum 

5. measurement component on the 
state vector is state vector times 
expectation value 



FIG. 4: The measurement operator multiplies each state by 
the expectation value of the measurement. 



We can start from the simplest case, which is a vector 
that represents one of the final states of the measure- 
ments. In this case, we will simply multiply by the asso- 
ciated measurement value. Since the norm of the state 
vector is one, the product of the measurement vector and 
the state vector will be the measurement value. 



(4a) 
(4b) 



Mathematically, this will mean that the final states of a 
measurement are represented by the eigenvectors of the 
operator: an eigenvector is a vector that the operator 
changes only in the norm, and not in the direction. The 
measurement value, then, will be the eigenvalue associ- 
ated with that eigenvector. 

Since the expectation is a linear operator, we will re- 
quire that the operators associated with our measure- 
ments will also be linear. This means that for a vector 
that is not a final state, each component along the direc- 
tions of the final states will be multiplied by the eigen- 
value associated with that state. Since the value will 
be different along each direction, a state that is not a 
final state will point in a different direction. So, only 
the final states associated with the measurement will be 
eigenstates.'' 

What is even more interesting is what happens to the 
norm of the vector. We know that the state vector is the 
sum of all outcomes weighted according to their proba- 
bilities^ 



cqI^o) +ci|«'i) 



(5) 



To be absolutely clear: the fact that the final states and only the 
final states are eigenstates of the measurement operator does not 
represent the fact that the state is not changed by the measure- 
ment. The two statements are both true, but they are unrelated. 
The first statement is represented mathematically by the fact 
that the final states are orthogonal, while the second represents 
the physical fact that the final states are associated to a well 
defined measurement and not to an expectation value. 
The relationship is slightly looser: to be precise, it is actually a 
complex number that multiplied by its complex conjugate returns 
the probability. 
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When wc apply our measurement operator what we have 
are all the possible measurement values weighted accord- 
ing to their probability. 

Al^-) =aoco|«'o)+aici|*i) + ... (6) 

If we take the inner product with the original state vector, 
we will end up with the expectation value 

{^'\A\^)=aoPo + CHPi + ... (7) 

This is indeed a powerful way to represent measure- 
ments. There are other nice properties the operators 
share, which come into play when using them in equa- 
tions, but we will not digress since our point here was 
simply to reach the third mathematical postulate, which 
we did. 



XII. CONCLUSION 

Wc; have scon how wc can derive quantum mechanics 
from the simple physical postulate that all measurements 
are discrete interactions: this is the main difference in- 
troduced by this theory. The fact that there arc incom- 
patible measurements, that our state is only defined up 
to a set of compatible ones, that our predictions will be 
probabilistic, that we need a complex vector space as our 
phase space and that we associate operators with each 
measurement are all consequences. All the rest that can 
be derived from these, such as the uncertainty principle 
or interference, will be consequences as well. 



We believe that presenting quantum mechanics in this 
way helps develop a more intuitive physical understand- 
ing and makes it feel less arbitrary and more necessary. 
We saw that the probabilistic nature of our predictions 
are basically linked to our inability to measure, and there- 
fore distinguish between, states that differ from each 
other only by incompatible measurements. We saw that 
quantum probability is very different from classical prob- 
ability, because while the latter deals with uncertainty 
coming from lack of measurement, the former deals with 
uncertainty coming from impossibility of measurement. 
The discussion on the two-direction spin system is very 
instructive because it allows us to visualize the phase 
space very clearly, which makes it easier then to gener- 
alize to more complex systems. And finally we saw how 
the operators associated with measurements are actually 
not describing a transformation of the state, as a rotation 
or a translation would, but they are merely multiplying 
the state by the measurement values. These are all small 
but important points that help us sharpen what physics 
we are describing and how it is represented in our math. 



We do not think this is the whole story, though. As 
c is a very critical constant in special relativity, h is in 
quantum mechanics and we would expect it mentioned 
in the physical postulates. This is probably a hint that 
we are still missing a piece. 

Nonetheless, we believe that this derivation is a good 
step forward in our understanding of quantum mechanics 
and a useful tool when introducing the subject. 
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